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TITLE: A cell with improved secretion mediated by MrgA protein or homologue 1 0 DEC, 2003 

Modtaget 

FIELD OF THE INVENTION 

In the industrial production of secreted polypeptides it is of interest to achieve a product 
5 yield as high as possible. Accordingly, it is highly desirable to remove any potential bottlenecks 
from the secretory machinery of production cells. To this end it is well-known that it can be 
advantageous to overexpress one or more gene(s) encoding protein(s) involved with secretion, 
e.g M PrsA protein or functional homologues thereof. The present invention relates to a cell 
which overexpresses MrgA protein or functional MrgA protein homologue. 

10 

BACKGROUND OF THE INVENTION 

The MrgA protein of Bacillus subtilis was originally classified as a Dps(PexB) 

homologue, encoded by a metalloregulated oxidative-stress gene (metallo regulated gene) 

mrgA. One purported function of the MrgA protein in 8. subtilis is to bind DNA under conditions 
15 of oxidative stress and to protect the DNA against damage (Chen L, Helmann JD. 1995. 

Bacillus subtilis MrgA is a Dps(PexB) homologue: evidence for metalloregulation of an 

oxidative stress gene. Mol Microbiol 18: 295-300). 

A B. subtilis mrgA deletion mutant only had a somewhat reduced overall level of 

secreted proteins, and it was therefore broadly concluded that MrgA is not involved in protein 
20 secretion in 8. subtilis, (van Wely KH, Swaving J, Klein M, Freudl R, Driessen AJ. 2000. The 

carboxyl terminus of the Bacillus subtilis SecA is dispensable for protein secretion and viability. 

Microbiology 146: 2573-81). 

However, the present inventors have found, as demonstrated herein, that MrgA is in 

fact involved in secretion in Bacillus, and that a higher expression of mrgA leads to a higher 
25 secretion of an exoenzyme, exemplified below by improved secretion of a heterologous alpha- 

amylase, 

SUMMARY OF THE INVENTION 

Severe secretion stress was imposed on a Bacillus subtilis cell by overexpressing an 
30 exoenzyme, the alpha-amylase AmyQ of Bacillus amyloliquefaciens, encoded by a plasmid- 
-borne constitutively expressed gene in the cell. DNA microarray analyses revealed an 
increased expression of the general stiBSS-proteia/nrgA as a response to the imposed 
"secretion stress. 

The mrgA gene was amplified by PCR from the chromosome of B. subtilis. Three PCR 
35 reactions were carried out with three different upstream PCR primers, each comprising the 
sequence of a synthetic constitutive promoter of different strength. The three amplified PCR 
fragments were integrated into the chromosome of individual 8. subtilis strains, resulting in 
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three recombinant strains, each expressing mrgA from the native locus, and from the 
integrated mrgA copy, which was transcribed from the synthetic promoter. 

The three strains overexpressing mrgA, and a corresponding control strain, were then 
transformed with the plasmid pKTHIO, which carries and constitutively overexpresses the 
5 gene encoding the alpha-amylase AmyQ of B. amyloHquefaciens. 

The yields of secreted AmyQ amylase from the transformed strains were determined 
after 1 weeks cultivation in 200 ml BPX culture flasks. For each of the three MrgA 
overexpressing strains, and the control strain, three independent isolates were analysed in 
triplicate to determine the amylase yields. The yields of secreted amylase from the MrgA 
1 0 overexpressing strains were 27% - 44% higher than the yield from the control strain. 

Accordingly, in a first aspect the invention relates to a progeny cell derived from a 
parent cell, wherein 

a) the progeny cell comprises at least one gene encoding MrgA protein or a functional 
homologue thereof and/or a DNA segment operably linked with the encoding gene, 

1 5 wherein said gene and/or DNA segment is manipulated with respect to the parent cell; 

b) the progeny cell comprises two or more copies of a gene encoding MrgA protein or a 
functional homologue thereof; or 

c) the progeny cell is mutated with respect to the parent cell; 

whereby the progeny cell produces greater amounts of MrgA protein or a functional 
20 homologue thereof than the parent cell. 

In the present context, a functional homologue of the MrgA protein is a protein, which 
when expressed at a higher level in a cell, leads to an increased secretion of an exoenzyme, 
such as an alpha-amylase, when compared with an otherwise identical cell with normal 
expression of the MrgA functional homologue cultivated under essentially identical conditions. 
25 In addition, the functional homologue of the MrgA protein shares an amino acid sequence 
identity with the MrgA protein of at least 50%. preferably 55%, 60%, 65%, 70%, 75%, 80%. 
85%, 90%, 95%, 97%, or most preferably 99% when aligned as described above. 

A second aspect of the invention relates to a method for enhancing secretion of an 
exoproteln of interest, the method comprising expressing said exoprotein in a cell according to 
30 the first aspect. 

A third aspect of the invention relates to a method for producing a cell as defined in the 
first aspect useful for production of an exoprotein of interest, said method comprising a step of 
manipulating a cell to increase the expression of MrgA protein or functional homologue thereof. 
In a fourth aspect the invention relates to a method for producing an exoprotein of 
35 interest, comprising the steps of: 

a) cultivating a cell as defined in the first aspect; and 

b) recovering the exoprotein. 
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In a final aspect the invention relates to the use of MrgA-protein or a functional 
homologue thereof in a method for enhancing secretion of an exoprotein by manipulating or 
mutating a cell to express greater amounts of MrgA protein or functional homologue thereof 
than the non-manipulated or non-mutated cell. 

DEFINITIONS 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill of the art. 
Such techniques are explained fully in the literature. See, e.g., Sambrook. Fritsch & Maniatis, 
Molecular Cloning: A Laboratory Manual, Second Edition (1989) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (herein "Sambrook et al., 1989") DNA 
Cloning: A Practical Approach. Volumes I and II /D.N. Glover ed. 1985); Oligonucleotide 
Synthesis (M.J. Gait ed. 1984); Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins eds 
(1985)); Transcription And Translation (B.D. Hames & S.J. Higgins, eds. (1984)); Animal Cell 
Culture (R.I. Freshney, ed. (1986)); Immobilized Cells And Enzymes (IRL Press, (1986)); B. 
Perbal, A Practical Guide To Molecular Cloning (1984). 

A "polynucleotide" is a single- or double-stranded polymer of deoxyribonucleotide or 
ribonucleotide bases read from the 5' to the 3' end. Polynucleotides include RNA and DNA, 
and may be isolated from natural sources, synthesized in vitro, or prepared from a combination 
20 of natural and synthetic molecules. 

A "nucleic acid molecule" or "nucleotide sequence" refers to the phosphate ester 
polymeric form of ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA 
molecules") or deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, or 
deoxycytidine; "DNA molecules") in either single stranded form, or a double-stranded helix. 
Double stranded DNA-DNA, DNA-RNA and RNA-RNA helices are possible. The term nucleic 
acid molecule, and in particular DNA or RNA molecule, refers only to the primary and 
secondary structure of the molecule, and does not limit it to any particular tertiary or quaternary 
forms. Thus, this term includes double-stranded DNA found, inter alia, in linear or circular DNA 
molecules (e.g., restriction fragments), plasmids, and chromosomes. In discussing the 
structure of particular double-stranded DNA molecules, sequences may be described herein 
according to the normal convention of giving only the sequence in the 5' to 3' direction along 
the nontranscribed strand of DNA (i.e., the strand having a sequence homologous to the 
•mRNA). A "recombinant DNA molecule" is a DNA molecule that has undergone a molecular 
biological manipulation. 

A nucleic add molecule is "hybridizable" to another nucleic acid molecule, such as a 
cDNA, genomic DNA. or RNA, when a single stranded form of the nucleic acid molecule can 
anneal to the other nucleic acid molecule under the appropriate conditions of temperature and 
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solution ionic strength (see Sambrook et al., supra). The conditions of temperature and ionic 
strength determine the "stringency" of the hybridization. 

For purposes of the present invention, hybridization indicates that the nucleotide 
sequence hybridizes to a labeled polynucleotide probe which hybridizes to the nucleotide 

5 sequence shown in SEQ ID NO:1 under very low to very high stringency conditions. 
Molecules to which the polynucleotide probe hybridizes under these conditions may be 
detected using X-ray film or by any other method known in the art. Whenever the term 
"polynucleotide probe" is used in the present context, it is to be understood that such a probe 
contains at least 15 nucleotides. 

10 In an interesting embodiment, the polynucleotide probe is the complementary strand of 

a fragment of at least 15 nucleotides of SEQ ID NO:1. In another interesting embodiment, the 
polynucleotide probe is a fragment of at least 15 nucleotides of the complementary strand of 
any nucleotide sequence which encodes the polypeptide of SEQ ID NO:2. In a further 
interesting embodiment, the polynucleotide probe is the complementary strand of SEQ ID 

15 NO:1 . In a still further interesting embodiment, the polynucleotide probe is the complementary 
strand of the mature polypeptide coding region of SEQ ID NO:1. 

For long probes of at least 100 nucleotides in length, very low to very high stringency 
conditions are defined as prehybridization and hybridization at 42°C in 5X SSPE, 1 .0% SDS, 
5X Denhardt's solution, 100 Dg/ml sheared and denatured salmon sperm DNA, following 

20 standard Southern blotting procedures. Preferably, the long probes of at least 100 nucleotides 
do not contain more than 1000 nucleotides. For long probes of at least 100 nucleotides in 
length, the carrier material is finally washed three times each for 15 minutes using 2 x SSC, 
0.1% SDS at 42°C (very low stringency), preferably washed three times each for 15 minutes 
using 0.5 x SSC, 0.1% SDS at 42°C (low stringency), more preferably washed three times 

25 each for 15 minutes using 0.2 x SSC, 0.1% SDS at 42°C (medium stringency), even more 
preferably washed three times each for 15 minutes using 0.2 x SSC, 0.1% SDS at 55°C 
(medium-high stringency), most preferably washed three times each for 15 minutes using 0.1 x 
SSC, 0.1% SDS at 60°C (high stringency), in particular washed three times each for 15 
minutes using 0.1 x SSC, 0.1% SDS at 68°C (very high stringency). 

30 Although not particularly preferred, it is contemplated that shorter probes, e.g. probes 

,which are from about 15 to 99 nucleotides in length, such as from about 15 to about 70 
nucleotides in length, may be also be used. For such short probes, stringency conditions are 
^defined as prehybridization, hybridization, and washing post-hybridization at 5°C to 10°C below 
the calculated Tm using the calculation according to Bolton and McCarthy (1962, Proceedings 

35 of the National Academy of Sciences USA 48:1390) in 0.9 M NaCI. 0.09 M Tris-HCI pH 7.6, 6 
mM EDTA, 0.5% NP-40, 1X Denhardt's solution, 1 mM sodium pyrophosphate, 1 mM sodium 
monobasic phosphate, 0.1 mM ATP, and 0.2 mg of yeast RNA per ml following standard 
Southern blotting procedures. 



For short probes which are about 15 nucleotides to 99 nucleotides in length, the carrier 
material is washed once in 6X SCC plus 0.1% SDS for 15 minutes and twice each for 15 
minutes using 6X SSC at 5°C to 10°C below the calculated Tm. 

A DNA "coding sequence" or an "open reading frame (ORF)" is a double-stranded DNA 
sequence which is transcribed and translated into a polypeptide in a cell in vitro or in vivo when 
placed under the control of appropriate regulatory sequences. The boundaries of the coding 
sequence are determined by a start codon at the 5* (amino) terminus and a translation stop 
codon at the 3' (carboxyl) terminus. A coding sequence can include, but is not limited to, 
prokaryotic sequences. cDNA from eukaryotic mRNA, genomic DNA sequences from 
eukaryotic (e.g.. mammalian) DNA, and even synthetic DNA sequences. If the coding 
sequence is intended for expression in a eukaryotic cell, a polyadenylation signal and 
transcription termination sequence will usually be located 3' to the coding sequence. 

An expression vector is a DNA molecule, linear or circular, that comprises a segment 
encoding a polypeptide of interest operably linked to additional segments that provide for its 
transcription. Such additional segments may include promoter and terminator sequences, and 
optionally one or more origins of replication, one or more selectable markers, an enhancer, a 
polyadenylation signal, and the like. Expression vectors are generally derived from plasmid or 
viral DNA, or may contain elements of both. 

Transcriptional and translational control sequences are DNA regulatory sequences, 
such as promoters, enhancers, terminators, and the like, that provide for the expression of a 
coding sequence in a host cell. In eukaryotic cells, polyadenylation signals are control 
sequences. 

A "secretory signal sequence" is a DNA sequence that encodes a polypeptide (a 
"secretory peptide" that, as a component of a larger polypeptide, directs the larger polypeptide 
through a secretory pathway of a cell in which it is synthesized. The larger polypeptide is 
commonly cleaved to remove the secretory peptide during transit through the secretory 
pathway. 

The term "promoter" is used herein for its art-recognized meaning to denote a portion of 
a gene containing DNA sequences that provide for the binding of RNA polymerase and 
initiation of transcription. Promoter sequences are commonly, but not always, found in the 5' 
.non-coding regions of genes. 

A chromosomal gene is rendered non-functional if the polypeptide that the gene 
•encodes can no longer be expressed in a functional form. Such non-functionality of a gene can 
be induced by a wide variety of genetic manipulations as known in the art, some of which are 
described in Sambrook et al. vide supra. Partial deletions within the ORF of a gene will often 
render the gene non-functional, as will mutations. 

The term "an expressible copy of a chromosomal gene" is used herein as meaning a 
copy of the ORF of a chromosomal gene, wherein the ORF can be expressed to produce a 
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• fully functional gene product. The expressible copy may not be transcribed from the native 
promoter of the chromosomal gene, it may instead be transcribed from a foreign or 
heterologous promoter, or it may indeed be promoterless and expressed only by transcriptional 
read-through from a gene present upstream of the 5' end of the ORF. Transcriptional read- 

5 through is intended to have the same meaning here as the generally recognized meaning in 
the art. 

"Operably linked", when referring to DNA segments, indicates that the segments are 
arranged so that they function In concert for their intended purposes, e.g. transcription initiates 
in the promoter and proceeds through the coding segment to the terminator. 
1 o A coding sequence is "under the control" of transcriptional and translations control 

sequences in a cell when RNA polymerase transcribes the coding sequence into mRNA. which 
is then trans-RNA spliced and translated into the protein encoded by the coding sequence. 

"Heterologous" DNA refers to DNA not naturally located in the cell, or in a chromosomal 
site of the cell. Preferably, the heterologous DNA includes a gene foreign to the cell. 
15 As used herein the term "nucleic acid construct' is intended to indicate any nucleic add 

molecule of cDNA, genomic DNA, synthetic DNA or RNA origin. The term "construct" is 
intended to indicate a nucleic acid segment which may be single- or double-stranded, and 
which may be based on a complete or partial naturally occurring nucleotide sequence 
encoding a polypeptide of interest. The construct may optionally contain other nucleic acid 
20 segments. 

The nucleic acid construct of the invention encoding the polypeptide of the invention 
may suitably be of genomic or cDNA origin, for instance obtained by preparing a genomic or 
cDNA library and screening for DNA sequences coding for all or part of the polypeptide by 
hybridization using synthetic oligonucleotide probes in accordance with standard techniques 

25 (cf. Sambrook et a»., supra). 

The nucleic acid construct of the invention encoding the polypeptide may also be 
prepared synthetically by established standard methods, e.g. the phosphoamidite method 
described by Beaucage and Caruthers, Tetrahedron Letters 22 (1981). 1859 - 1869, or the 
method described by Matthes et al., EMBO Journal 3 (1984), 801 - 805. According to the 
30 phosphoamidite method, oligonucleotides are synthesized, e.g. in an automatic DNA 
"synthesizer, purified, annealed, ligated and cloned in suitable vectors. 

Furthermore, the nucleic acid construct may be of mixed synthetic and genomic, mixed 
synthetic and cDNA or mixed genomic and cDNA origin prepared by ligating fragments of 
synthetic, genomic or cDNA origin (as appropriate), the fragments corresponding to various 
parts of the entire nucleic acid construct, in accordance with standard techniques. The nucleic 
acid construct may also be prepared by polymerase chain reaction using specific primers, for 
instance as described in US 4,683,202 or Saiki et al., Science 239 (1988), 487 - 491 . 
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The term nucleic acid construct may be synonymous with the term "expression 
cassette" when the nucleic acid construct contains the control sequences necessary for 
expression of a coding sequence of the present invention 

The term "control sequences" is defined herein to include all components which are 
necessary or advantageous for expression of the coding sequence of the nucleic acid 
sequence. Each control sequence may be native or foreign to the nucleic acid sequence 
encoding the polypeptide. Such control sequences include, but are not limited to, a leader, a 
polyadenylation sequence, a propeptide sequence, a promoter, a signal sequence, and a 
transcription terminator. At a minimum, the control sequences include a promoter, and 
transcriptional and translational stop signals. The control sequences may be provided with 
linkers for the purpose of introducing specific restriction sites facilitating ligation of the control 
sequences with the coding region of the nucleic acid sequence encoding a polypeptide. 

The control sequence may be an appropriate promoter sequence, a nucleic acid 
sequence which is recognized by a host cell for expression of the nucleic acid sequence. The 
promoter sequence contains transcription and translation control sequences which mediate the 
expression of the polypeptide. The promoter may be any nucleic acid sequence which shows 
transcriptional activity in the host cell of choice and may be obtained from genes encoding 
extracellular or intracellular polypeptides either homologous or heterologous to the host cell. 

The control sequence may also be a suitable transcription terminator sequence, a 
sequence recognized by a host cell to terminate transcription. The terminator sequence is 
operably linked to the 3* terminus of the nucleic acid sequence encoding the polypeptide. Any 
terminator which is functional in the host cell of choice may be used in the present invention. 

The control sequence may also be a polyadenylation sequence, a sequence which is 
operably linked to the 3* terminus of the nucleic acid sequence and which, when transcribed, is 
recognized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. 
Any polyadenylation sequence which is functional in the host cell of choice may be used in the 
present invention. 

The control sequence may also be a signal peptide coding region, which codes for an 

amino acid sequence linked to the amino terminus of the polypeptide which can direct the 

expressed polypeptide into the cell's secretory pathway of the host cell. The 5' end of the 

coding sequence of the nucleic acid sequence may inherently contain a signal peptide coding 

region naturally linked in translation reading frame with the segment of the coding region which 

•encodes the secreted polypeptide. Alternatively, the 5' end of the coding sequence may 

contain a signal peptide coding region which is foreign to that portion of the coding sequence 

which encodes the secreted polypeptide. A foreign signal peptide coding region may be 

required where the coding sequence does not normally contain a signal peptide coding region. 

Alternatively, the foreign signal peptide coding region may simply replace the natural signal 

peptide coding region in order to obtain enhanced secretion of the exoprotein relative to the 

7 
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natural signal peptide coding region normally associated with the coding sequence. The signal 
peptide coding region may be obtained from a glucoamylase or an amylase gene from an 
Aspergillus species, a lipase or proteinase gene from a Rhizomucor species, the gene for the 
alpha-factor from Saccharomyces cerevisiae, an amylase or a protease gene from a Bacillus 
5 species, or the calf preprochymosin gene. However, any signal peptide coding region capable 
of directing the expressed polypeptide into the secretory pathway of a host cell of choice may 
be used in the present invention. 

The control sequence may also be a propeptide coding region, which codes for an 
amino acid sequence positioned at the amino terminus of a polypeptide. The resultant 

10 polypeptide is known as a proenzyme or propolypeptide (or a zymogen in some cases). A 
propolypeptide is generally inactive and can be converted to mature active polypeptide by 
catalytic or autocatalytic cleavage of the propeptide from the propolypeptide. The propeptide 
coding region may be obtained from the Bacillus subtilis alkaline protease gene (aprE), the 
Bacillus subtilis neutral protease gene (nprT), the Saccharomyces cerevisiae alpha-factor 

15 gene, or the Myceliophthora thermophilum laccase gene (WO 95/33836). 

It may also be desirable to add regulatory sequences which allow the regulation of the 
expression of the polypeptide relative to the growth of the host cell. Examples of regulatory 
systems are those which cause the expression of the gene to be turned on or off in response 
to a chemical or physical stimulus, including the presence of a regulatory compound. 

20 Regulatory systems in prokaryotic systems would include the lac, tac, and trp operator 
systems. In yeast, the ADH2 system or GAL1 system may be used. Other examples of 
regulatory sequences are those which allow for gene amplification. In eukaryotic systems, 
these include the dihydrofolate reductase gene which is amplified in the presence of 
methotrexate, and the metallothionein genes which are amplified with heavy metals. In these 

25 cases, the nucleic acid sequence encoding the polypeptide would be placed in tandem with the 
regulatory sequence. 

Examples of suitable promoters for directing the transcription of the gene(s) of the 
present invention, especially in a bacterial host cell, are the promoters obtained from the E co// 
lac operon, the Streptomyces coelicolor agarase gene (dagA), the Bacillus subtilis 

30 levansucrase gene (sacB), the Bacillus subtilis alkaline protease gene, the Bacillus 

Mcheniformis alpha-amylase gene (amyL), the Bacillus stearothermophilus maltogenic amylase 
gene (amyM), the Bacillus amylofiquefaciens alpha-amylase gene (amyQ). the Bacillus 
•amyloliquefaciens BAN amylase gene, the Bacillus licheniformis penicillinase gene (penP), the 
Bacillus subtilis xylA and xylB genes, and the prokaryotic beta-lactamase gene (Villa-Kamaroff 

35 et al. s 1978, Proceedings of the National Academy of Sciences USA 75:3727-3731 ), as well as 
the tac promoter (DeBoer et al., 1983, Proceedings of the National Academy of Sciences USA 
80:21-25). Further promoters are described in "Useful proteins from recombinant bacteria" in 
Scientific American, 1980, 242:74-94; and in Sambrook et al., 1989, supra. 
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An effective signal peptide coding region for bacterial host cells is the signal peptide 
coding region obtained from the maltogenic amylase gene from Bacillus NCIB 1 1837, the 
Bacillus stearothermophifus alpha-amylase gene, the Bacillus licheniformis subtilisin gene, the 
Bacillus licheniformis beta-lactamase gene, the Bacillus stearothermophilus neutral proteases 
5 genes (nprT, nprS, nprM), and the Bacillus subtilis PrsA gene. Further signal peptides are 
described by Simonen and Palva, 1993, Microbiological Reviews 57:109-137. 

The present invention also relates to recombinant expression vectors comprising a 
nucleic acid sequence of the present invention, a promoter, and transcriptional and 
translational stop signals. The various nucleic acid and control sequences described above 

10 may be joined together to produce a recombinant expression vector which may include one or 
more convenient restriction sites to allow for insertion or substitution of the nucleic acid 
sequence encoding the polypeptide at such sites. Alternatively, the nucleic acid sequence of 
the present invention may be expressed by inserting the nucleic acid sequence or a nucleic 
acid construct comprising the sequence into an appropriate vector for expression. In creating 

15 the expression vector, the coding sequence is located in the vector so that the coding 
sequence is operably linked with the appropriate control sequences for expression, and 
possibly secretion. 

The recombinant expression vector may be any vector (e.g., a plasmid or virus) which 
can be conveniently subjected to recombinant DNA procedures and can bring about the 

20 expression of the nucleic acid sequence. The choice of the vector will typically depend on the 
compatibility of the vector with the host cell into which the vector is to be introduced. The 
vectors may be linear or closed circular plasmids. The vector may be an autonomously 
replicating vector, i.e., a vector which exists as an extrachromosomal entity, the replication of 
which is independent of chromosomal replication, e.g., a plasmid, an extrachromosomal 

25 element, a minichromosome, or an artificial chromosome. The vector may contain any means 
for assuring self-replication. Alternatively, the vector may be one which, when introduced into 
the host cell, is integrated into the genome and replicated together with the chromosome(s) 
into which it has been integrated. The vector system may be a single vector or plasmid or two 
or more vectors or plasmids which together contain the total DNA to be introduced into the 

30 genome of the host cell, or a transposon. 

The vectors of the present invention preferably contain one or more selectable markers 
which permit easy selection of transformed cells. A selectable marker is a gene the product of 
which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to 
auxotrophs, and the like, 

35 Antibiotic selectable markers confer antibiotic resistance to such antibiotics as 

ampicillin, kanamycin, chloramphenicol, tetracycline, neomycin, hygromycin or methotrexate. 
Suitable markers for yeast host cells are ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and URA3. 
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The vectors of the present invention preferably contain an element(s) that permits 
stable integration of the vector, or of a smaller part of the vector, into the host cell genome or 
autonomous replication of the vector in the cell independent of the genome of the cell. 

The vectors, or smaller parts of the vectors such as amplification units of the present 

5 invention, may be integrated into the host cell genome when introduced into a host cell. For 
chromosomal integration, the vector may rely on the nucleic acid sequence encoding the 
polypeptide or any other element of the vector for stable integration of the vector into the 
genome by homologous or nonhomologous recombination. 

Alternatively, the vector may contain additional nucleic acid sequences for directing 

10 integration by homologous recombination into the genome of the host cell. The additional 
nucleic acid sequences enable the vector to be integrated into the host cell genome at a 
precise location(s) in the chromosome(s). To increase the likelihood of integration at a precise 
location, the integrational elements should preferably contain a sufficient number of nucleic 
acids, such as 100 to 1,500 base pairs, preferably 400 to 1,500 base pairs, and most 

15 preferably 800 to 1 ,500 base pairs, which are highly homologous with the corresponding target 
sequence to enhance the probability of homologous recombination. The integrational 
elements may be any sequence that is homologous with the target sequence in the genome of 
the host cell. Furthermore, the integrational elements may be non-encoding or encoding 
nucleic acid sequences; specific examples of encoding sequences suitable for site-specific 

20 integration by homologous recombination are given in WO 02/00907 (Novozymes, Denmark), 
which is hereby incorporated by reference in its totality. 

On the other hand, the vector may be Integrated into the genome of the host cell by 
non-homologous recombination. These nucleic acid sequences may be any sequence that is 
homologous with a target sequence in the genome of the host cell, and, furthermore, may be 

25 non-encoding or encoding sequences. The copy number of a vector, an expression cassette, 
an amplification unit, a gene or indeed any defined nucleotide sequence is the number of 
identical copies that are present in a host cell at any time. A gene or another defined 
chromosomal nucleotide sequence may be present in one, two, or more copies on the 
chromosome. An autonomously replicating vector may be present in one, or several hundred 

30 copies per host cell. 

An amplification unit of the invention is a nucleotide sequence that can integrate into 
the chromosome of a host cell, whereupon it can increase in number of chromosomally 
• integrated copies by duplication of multiplication. The unit comprises an expression cassette 
as defined herein comprising at least one copy of a gene of interest and an expressable copy 

35 of a chromosomal gene, as defined herein, of the host cell. When the amplification unit is 
integrated into the chromosome of a host cell, it is defined as that particular region of the 
chromosome which is prone to being duplicated by homologous recombination between two 
directly repeated regions of DNA. The precise border of the amplification unit with respect to 
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the flanking DNA is thus defined functionally, since the duplication process may indeed 
duplicate parts of the DNA which was introduced into the chromosome as well as parts of the 
endogenous chromosome itself, depending on the exact site of recombination within the 
repeated regions. This principle is illustrated in Janntere etal. (1985, Stable gene amplification 
5 in the chromosome of Bacillus subtilis. Gene, 40: 47-55), which is incorporated herein by 
reference. 

For autonomous replication, the vector may further comprise an origin of replication 
enabling the vector to replicate autonomously in the host cell in question. Examples of 
bacterial origins of replication are the origins of replication of plasmids pBR322, pU.C19, 

10 pACYC177, pACYC184, pUB110, pE194, pTA1060, and pAMbetal. Examples of origin of 
replications for use in a yeast host cell are the 2 micron origin of replication, the combination of 
CEN6 and ARS4, and the combination of CEN3 and ARS1. The origin of replication may be 
one having a mutation which makes its functioning temperature-sensitive in the host cell (see, 
e.g., Ehriich, 1978, Proceedings of the National Academy of Sciences USA 75:1433). 

15 The present invention also relates to recombinant host cells, comprising a nucleic acid 

sequence of the invention, which are advantageously used in the recombinant production of 
the polypeptides. The term "host cell" encompasses any progeny of a parent cell which is not 
identical to the parent cell due to mutations that occur during replication. 

The cell is preferably transformed with a vector comprising a nucleic acid sequence of 

20 the invention followed by integration of the vector into the host chromosome. "Transformation" 
means introducing a vector comprising a nucleic acid sequence of the present invention into a 
host cell so that the vector is maintained as a chromosomal integrant or as a self-replicating 
extra-chromosomal vector. Integration is generally considered to be an advantage as the 
nucleic acid sequence is more likely to be stably maintained in the cell. Integration of the 

25 vector into the host chromosome may occur by homologous or non-homologous recombination 
as described above. 

The transformation of a bacterial host cell may, for instance, be effected by protoplast 
transformation (see, e.g., Chang and Cohen, 1979, Molecular General Genetics 168:1 1 1-115), 
by using competent cells (see, e.g., Young and Spizizin, 1961 , Journal of Bacteriology 81:823- 
30 829, or Dubnar and Davfdoff-Abelson, 1971, Journal of Molecular Biology 56:209-221), by 
.electroporation (see, e.g., Shigekawa and Dower, 1988, Biotechniques 6:742-751), or by 
conjugation (see, e.g., Koehler and Thorne, 1987, Journal of Bacteriology 169:5771-5278). 

The transformed or transfected host cells described above are cultured in a suitable 
nutrient medium under conditions permitting the expression of the desired polypeptide, after 
35 which the resulting polypeptide is recovered from the cells, or the culture broth. 

The medium used to culture the cells may be any conventional medium suitable for 
growing the host cells, such as minimal or complex media containing appropriate supplements. 
Suitable media are available from commercial suppliers or may be prepared according to 
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published recipes (e.g. in catalogues of the American Type Culture Collection). The media are 
prepared using procedures known in the art (see, e.g., references for bacteria and yeast; 
Bennett, J.W. and LaSure, L, editors, More Gene Manipulations in Fungi, Academic Press, 
CA, 1991). 

5 If the polypeptide is secreted into the nutrient medium, the polypeptide can be 

recovered directly from the medium. If the polypeptide is not secreted, it is recovered from cell 
lysates. The polypeptide are recovered from the culture medium by conventional procedures 
including separating the host cells from the medium by centrifugation or filtration, precipitating 
the proteinaceous components of the supernatant or filtrate by means of a salt, e.g. 

10 ammonium sulphate, purification by a variety of chromatographic procedures, e.g. ion 

exchange chromatography, gelfiltration chromatography, affinity chromatography, or the like, 
dependent on the type of polypeptide in question. 

The polypeptides may be detected using methods known in the art that are specific for 
the polypeptides. These detection methods may include use of specific antibodies, formation 

15 of an enzyme product, or disappearance of an enzyme substrate. For example, an enzyme 
assay may be used to determine the activity of the polypeptide. 

The polypeptides of the present invention may be purified by a variety of procedures 
known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, 
hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g., 

20 preparative isoelectric focusing (iEF), differential solubility (e.g., ammonium sulfate 

precipitation), or extraction (see, e.g., Protein Purification, J.-C. Janson and Lars Ryden, 
editors, VCH Publishers, New York, 1989). 

In the present context, the term "substantially pure polypeptide" means a polypeptide 
preparation which contains at the most 10% by weight of other polypeptide material with which 

25 it is natively associated (lower percentages of other polypeptide material are preferred, e.g. at 
the most 8% by weight, at the most 6% by weight, at the most 5% by weight, at the most 4% at 
the most 3% by weight, at the most 2% by weight, at the most 1% by weight, and at the most 
!4% by weight). Thus, it is preferred that the substantially pure polypeptide is at least 92% 
pure, i.e. that the polypeptide constitutes at least 92% by weight of the total polypeptide 

30 material present in the preparation, and higher percentages are preferred such as at least 94% 
-pure, at least 95% pure, at least 96% pure, at least 96% pure, at least 97% pure, at least 98% 
pure, at least 99%, and at the most 99,5% pure. The polypeptides disclosed herein are 
•preferably in a substantially pure form. In particular, it is preferred that the polypeptides 
disclosed herein are in "essentially pure form", i.e. that the polypeptide preparation is 

35 essentially free of other polypeptide material with which it is natively associated. This can be 

accomplished, for example, by preparing the polypeptide by means of well-known recombinant 

methods. Herein, the term "substantially pure polypeptide" is synonymous with the terms 

"isolated polypeptide" and "polypeptide in isolated form". 

12 



In the present context, the homology between two amino acid sequences or 
between two nucleotide sequences is described by the parameter "identity". For 
purposes of the present invention, alignments of sequences and calculation of homology 
scores may be done using a full Smith-Waterman alignment, useful for both protein and DNA 
alignments. The default scoring matrices BLOSUM50 and the identity matrix are used for 
protein and DNA alignments respectively. The penalty for the first residue in a gap is -12 for 
proteins and -16 for DNA, while the penalty for additional residues in a gap is -2 for proteins 
and -4 for DNA. Alignment may be made with the FASTA package version v20u6 (W. R. 
Pearson and D. J. Lipman (1988), "Improved Tools for Biological Sequence Analysis", PNAS 
85:2444-2448, and W. R. Pearson (1990) "Rapid and Sensitive Sequence Comparison with 
FASTP and FASTA", Methods in Enzymology, 183:63-98). 

Multiple alignments of protein sequences may be made using "ClustalW" (Thompson, 
J.D., Higgins, D.G. and Gibson, TJ. (1994) CLUSTAL W: improving the sensitivity of 
progressive multiple sequence alignment through sequence weighting, positions-specific gap 
penalties and weight matrix choice. Nucleic Acids Research, 22:4673-4680). Multiple 
alignment of DNA sequences may be done using the protein alignment as a template, 
replacing the amino acids with the corresponding codon from the DNA sequence. 

In the present context, a functional homologue of the MrgA protein is a protein, which 
when expressed at a higher level in a cell, leads to an increased secretion of an exoenzyme, 
such as an alpha-amylase, when compared with a cell with normal expression of the MrgA 
functional homologue cultivated under essentially identical conditions. In addition, the 
functional homologue of the MrgA protein shares an amino acid sequence identity with the 
MrgA protein of at least 50%, preferably 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 
97%, or most preferably 99% when aligned as described above. 

In the present context, the term "allelic variant" denotes any of two or more alternative 
forms of a gene occupying the same chromosomal locus. Allelic variation arises naturally 
through mutation, and may result in polymorphism within populations. Gene mutations can be 
silent (no change in the encoded polypeptide) or may encode polypeptides having altered 
amino acid sequences. An allelic variant of a polypeptide is a polypeptide encoded by an 
allelic variant of a gene. Allelic variants are included in the present definition of functional 
homologues. 

The MrgA protein or functional homologue thereof may be a wild-type protein identified 
and isolated from a natural source. Such wild-type proteins may be specifically screened for by 
standard techniques known in the art. Furthermore, the MrgA protein or functional homologue 
thereof may be prepared by the DNA shuffling technique, such as described in J.E. Ness et al. 
Nature Biotechnology 17, 893-896 (1999). Moreover, the MrgA protein or functional 
homologue thereof may be an artificial variant. Such artificial variants may be constructed by 

13 
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standard techniques known in the art, such as by site-directed/random mutagenesis. In one 
embodiment of the invention, amino acid changes (in the artificial variant as well as in wild-type 
polypeptides) are of a minor nature, that is conservative amino acid substitutions that do not 
significantly affect the folding and/or activity of the protein; small deletions, typically of one to 
5 about 30 amino acids; small amino- or carboxyl-terminal extensions, such as an amino- 
terminal methionine residue; a small linker peptide of up to about 20-25 residues; or a small 
extension that facilitates purification by changing net charge or another function, such as a 
poly-histidine tract, an antigenic epitope or a binding domain. 

Examples of conservative substitutions are within the group of basic amino acids 

10 (arginine, lysine and histidine), acidic amino acids (glutamic acid and aspartic acid), polar 

amino acids (glutamine and asparagine), hydrophobic amino acids (leucine, isoleucine, valine 
and methionine), aromatic amino acids (phenylalanine, tryptophan and tyrosine), and small 
amino acids (glycine, alanine, serine and threonine). Amino acid substitutions which do not 
generally alter the specific activity are known in the art and are described, for example, by H. 

15 Neurath and R.L Hill, 1979, In, The Proteins. Academic Press, New York. The most commonly 
occurring exchanges are Ala/Ser, Val/lle f Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr, Ser/Asn, Ala/Val, 
Ser/Gly, Tyr/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/lle, LeuA/al, Ala/Glu, and Asp/Gly as well as 
these in reverse. 

It will be apparent to those skilled in the art that such modifications can be made 
20 outside the regions critical to the function of the molecule and still result in an active 

polypeptide. Amino acid residues essential to the activity of the polypeptide encoded by the 
nucleotide sequence of the invention, and therefore preferably not subject to modification, such 
as substitution, may be identified according to procedures known in the art, such as site- 
directed mutagenesis or alanine-scanning mutagenesis (see, e.g., Cunningham and Wells, 
25 1989, Science 244: 1081-1085). In the latter technique, mutations are introduced at every 
positively charged residue In the molecule, and the resultant mutant molecules are tested for 
activity to identify amino acid residues that are critical to the activity of the molecule. Sites of 
substrate-enzyme interaction can also be determined by analysis of the three-dimensional 
structure as determined by such techniques as nuclear magnetic resonance analysis, 
30 crystallography or photoaffinity labelling (see, e.g., de Vos et al., 1992, Science 255: 306-312; 
.Smith et al., 1992, Journal of Molecular Biology 224: 899-904; Wlodaver et al., 1992, FEBS 
Letters 309: 59-64). 

Moreover, a nucleotide sequence encoding a polypeptide of the present invention may 
be modified by introduction of nucleotide substitutions which do not give rise to another amino 
35 acid sequence of the polypeptide encoded by the nucleotide sequence, but which correspond 
to the codon usage of the host organism intended for production of the enzyme. 

The introduction of a mutation into the nucleotide sequence to exchange one 
nucleotide for another nucleotide may be accomplished by site-directed mutagenesis using 



any of the methods known in the art. Particularly useful is the procedure, which utilizes a 
supercoiled, double stranded DNA vector with an insert of interest and two synthetic primers 
containing the desired mutation. The oligonucleotide primers, each complementary to opposite 
strands of the vector, extend during temperature cycling by means of Pfu DNA polymerase. 
On incorporation of the primers, a mutated plasmid containing staggered nicks is generated. 
Following temperature cycling, the product is treated with Dpnl which is specific for methylated 
and hemimethylated DNA to digest the parental DNA template and to select for mutation- 
containing synthesized DNA. Other procedures known in the art may also be used. For a 
general description of nucleotide substitution, see, e.g., Ford et al., 19gi, Protein Expression 
and Purification 2: 95-107. 

DETAILED DESCRIPTION OF THE INVENTION 

The first aspect of the invention relates to a progeny cell derived from a parent cell, 
wherein 

a) the progeny cell comprises at least one gene encoding MrgA protein or a functional 
homologue thereof and/or a DNA segment operably linked with the encoding gene, 
wherein said gene and/or DNA segment is manipulated with respect to the parent cell; 

b) the progeny cell comprises two or more copies of a gene encoding MrgA protein or a 
functional homologue thereof; or 

c) the progeny cell is mutated with respect to the parent cell; 

whereby the progeny cell produces greater amounts of MrgA protein or a functional homologue 
thereof than the parent celL 

The cell of the invention greater amounts of MrgA protein or a funtional homologue 
thereof, than the parent cell. A comparison should be made by cultivating the cell of the 
invention as well as the parent cell under essentially identical conditions, and comparing the 
amount of MrgA protein by any standard method in the art. Preferably the cell of the invention 
produces at least 5% more MrgA than the parent, more preferably at least 10%, still more 
preferably at least 20%, and most preferably at least 50% more MrgA protein or a functional 
homologue thereof than the parent Such overproduction may be accomplished by standard 
means known to the art, e.g., use of multicopy plasmids, multiple copies of the genes encoding 

•MrgA or a functional homologue thereof, and/or the exoprotein of interest, in the chromosome 
of the host, combined with altering the regulatory elements to increase expression, e.g., use of 

• strong promoter(s), use of multiple promoters, use of enhancers, and so forth. 

As the inventors show herein, a cell of the first aspect is capable of secreting greater 

amounts of an exoprotein of interest than the corresponding parent cell, when both are 

cultivated under essentially identical conditions. 

Accordingly, a preferred embodiment of the invention relates to the cell of the first 

aspect, which secretes greater amounts of an exoprotein of interest than the parent cell. 

15 



Preferably the cell of the invention secretes at least 5% more exoprotein than the parent, more 
preferably at least 10% more, still more preferably at least 20% more, and most preferably at 
least 50% more exoprotein than the parent. The amount of secreted exoprotein from each cell 
may be determined by any suitable assay in the art; a non-limiting example is shown below 
with the determination of secreted amounts of alpha-amylase. 

The choice of a host cell will to a large extent depend upon the gene encoding the 
polypeptide and its source. The host cell may be a unicellular microorganism, e.g., a 
prokaryote, or a non-unicellular microorganism, e.g., a eukaryote. Useful unicellular cells are 
bacterial cells such as gram positive bacteria including, but not limited to, a Bacillus cell, e.g., 
Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus 
coagulans, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus 
stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis; or a Streptomyces cell, e.g., 
Streptomyces lividans or Streptomyces murinus, or gram negative bacteria such as E. coli and 
Pseudomonas sp. In a preferred embodiment, the bacterial host cell is a Bacillus lentus, 
Bacillus licheniformis, Bacillus stearothermophitus or Bacillus subtilis cell. In one preferred 
embodiment, the bacterial host cell is a prokaryotic cell, preferably a Gram-positive prokaryotic 
cell, and more preferably the bacterial Gram positive cell is a species of the genus Bacillus, 
preferably selected from the group consisting of Bacillus alkalophilus, Bacillus 
amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus coagulans, Bacillus lautus, 
Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus stearothermophilus, 
Bacillus subtilis, and Bacillus thuringiensis. 

A preferred embodiment relates to the cell of the invention, which is a bacterial cell, 
preferably a prokaryotic cell, more preferably a Gram-positive cell, and most preferably of the 
genus Bacillus', still more preferably it is of a species chosen from the group consisting of 
Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus 
coagulans, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus stearothermophilus, 
Bacillus subtilis, and Bacillus thuringiensis. 

The exoprotein of interest may be endogenous or exogenous to the host cell, it may be 
a homologous exoprotein, or a heterologous exoprotein. 

A preferred embodiment relates to the ceil, wherein said exoprotein is a protease, a 
lipase, a cutinase, an amylase, a galactosidase, a pullulanase, a cellulase, a glucose 
isomerase, a protein disulphide isomerase, a CGTase (cyclodextrin gluconotransferase), a 
phytase, a glucose oxidase, a glucosyl transferase, lactase, bilirubin oxidase, a xylanase, an 
antigenic microbial or protozoan protein, a bacterial protein toxin, a microbial surface protein, 
or a viral protein. 

An evolutionary homologue of the MrgA protein, an allellic variant, an artificial variant, a 
shuffled variant, a species variant, and so forth, are all referred to as a "functional homologue" 
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or the MrgA protein in the present description, and the inventors envision that such functional 
homologue protein(s) will be equally effective in the cell of the Invention. 

Specifically, a preferred embodiment relates to the cell, wherein the MrgA protein or 
functional homologue thereof comprises an amino acid sequence which is at least 70% 
identical to the amino sequence shown in SEQ ID NO:2, preferably at least 75%, 80%, 85%, 
90%, 95%, 97%, or even 99% identical to the amino sequence shown in SEQ ID NO:2. 

Another preferred embodiment relates to the cell of the invention, wherein the MrgA 
protein or functional homologue thereof comprises or consists of the amino acid sequence 
shown in SEQ ID NO:2. 

Still another preferred embodiment relates to the cell of the invention, which comprises 
at least one exogenous copy of a polynucleotide encoding MrgA protein or a functional 
homologue thereof comprising an amino acid sequence which is at least 70% identical to the 
amino sequence shown in SEQ ID NO:2, preferably at least 75%, 80%, 85%, 90%, 95%, 97%, 
or even 99% identical to the amino sequence shown in SEQ ID NO:2. 

In a preferred embodiment the cell of the invention comprises at least one exogenous 
copy of a polynucleotide encoding MrgA protein or a functional homologue thereof comprising 
or consisting of the amino acid sequence shown in SEQ ID NO:2. 

A preferred cell comprises at least one exogenous copy of a polynucleotide, which: 

a) comprises a polynucleotide sequence which Is at least 70% identical to the sequence 
shown in SEQ ID NO:1; preferably at least 75%, 80%, 85%, 90%, 95%, 97%, or even 
99% identical to the sequence shown in SEQ ID NO:1; or 

b) hybridizes with the sequence shown in SEQ ID NO:1 , under medium stringency 
conditions, preferably under medium-high stringency, and more preferably under high 
stringency conditions. 

As described above, and exemplified herein, one preferred embodiment relates to a 
cell, wherein at least one exogenous copy of a gene encoding the MrgA protein or a functional 
homologue thereof is transcribed from one or more heterologous and/or artificial promoter. 

In a preferred cell, at least one exogenous copy of a gene encoding the MrgA protein or 
a functional homologue thereof is integrated into the genome of the cell; or is present on an 
extra-chromosomal construct, preferably a plasmid. 

Another aspect of the invention relates to a method for enhancing secretion of an 
exoprotein of interest, the method comprising expressing said exoprotein in a cell according to 
the first aspect. 

Still another aspect of the invention relates to a method for producing a cell as defined 
in the first aspect, useful for production of an exoprotein of interest, said method comprising a 
step of manipulating a cell to increase the expression of MrgA protein or functional homologue 
thereof. 
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In the production methods of the present invention, the cells are cultivated in a nutrient 
medium suitable for production of the polypeptide using methods known in the art. For 
example, the cell may be cultivated by shake flask cultivation, small-scale or large-scale 
fermentation (including continuous, batch, fed-batch, or solid state fermentations) in laboratory 
5 or industrial fermentors performed in a suitable medium and under conditions allowing the 
polypeptide to be expressed and/or isolated. The cultivation takes place in a suitable nutrient 
medium comprising carbon and nitrogen sources and inorganic salts, using procedures known 
in the art. Suitable media are available from commercial suppliers or may be prepared 
according to published compositions (e.g., in catalogues of the American Type Culture 
10 Collection). If the polypeptide is secreted into the nutrient medium, the polypeptide can be 
recovered directly from the medium. If the polypeptide is not secreted, it can be recovered 
from cell lysates. 

The polypeptides may be detected using methods known in the art that are specific for 

15 the polypeptides. These detection methods may include use of specific antibodies, formation 

of an enzyme product, or disappearance of an enzyme substrate. For example, an enzyme 

assay may be used to determine the activity of the polypeptide as described herein. 

« 

The polypeptides may be purified by a variety of procedures known in the art including, 
but not limited to, chromatography (e.g M ion exchange, affinity, hydrophobic, 
20 chromatofocusing, and size exclusion), electrophoretic procedures (e.g., preparative isoelectric 
focusing), differential solubility (e.g., ammonium sulfate precipitation), SDS-PAGE, 
centrifugation, filtration, extraction, spray-drying, evaporation, precipitation, or extraction (see, 
e.g., Protein Purification, J.-C. Janson and Lars Ryden, editors, VCH Publishers, New York, 
1989). 

25 In a preferred embodiment of the method of the third aspect, the manipulated cell 

secretes greater amounts of an exoprotein of interest than the non-manipulated parent cell. 

Another preferred embodiment relates to the method of the third aspect, wherein said 
method comprises the steps of: 

a) identifying a gene from the parent cell that encodes MrgA protein or a functional 
30 homologue thereof; and 

. b) manipulating the cell to increase the expression of the gene identified in step (a), whereby 
the manipulated progeny cell expresses greater amounts of MrgA protein or functional 
homologue thereof, than the non-manipulated cell. 

35 

EXAMPLES 
Materials and Methods 

Strains 
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B.subtilis 168: F. Kunst ef. a/. "The complete genome sequence of the Gram-positive 
bacterium Bacillus subtilis". Nature (1997) 390:249-256. 

B.subtilis AN53: B.subtitis 168 with plasmid pKTHIO and P920mrgA integrated in the amyE 
locus (this study). 

5 B.subtilis AN55: B.subtitis 168 with plasmid pKTHIO and P740mrgA integrated in the 
amyE locus (this study). 

B.subtilis AN57: B.subtilis 168 with plasmid pKTHIO and P726mrgA integrated in the amyE 
locus (this study). 

B.subtilis AN83: B.subtilis 168 with plasmid pKTHIO (this study). 

10 

Competent cells were prepared and transformed as described by Yasbin, R.E., Wilson, 
G.A. and Young, F.E. (1975) Transformation and transfection in lysogenic strains of Bacillus 
subtilis : evidence for selective induction of prophage in competent cells. J. Bacteriol, 121:296- 
304. 

15 

Plasmids 

PKTH10: Vehmaanpera J, Steinborn G, Hofemeister J.:" Genetic manipulation of Bacillus 
amyloliquefaciens." J Biotechnol. 1991. 19(2-3):221-40. This plasmid constitutively express 
the Bacillus amyloliquefaciens alpha-amylase (AmyQ). 

20 pDG268neo: This plasmid is a pDG268 derivative, which is unable to replicate in Bacillus 

subtilis (Antoniewski, C, Savelli, B., and Stragier, P., 1990, J. Bact 172). The plasmid contains 
the chloramphenicol (cam) resistance marker next to Sfi\ and BamHl restriction enzyme 
recoqnition sequences, flanked by the "5 in and m 3 n portions of the amyE locus of Bacillus 
subtilis. This plasmid is used for introduction of the MrgA expression cassette and the cam 

25 marker into the amyE locus of B. subtilis via double homologous recombination cross-over. 
The sequence of pDG268neo is shown in SEQ ID NO: 3. 

p920mroA: The mrgA gene was amplified by PCR from the chromosome of B. subtilis by 
use of primers p920mrgaF2 (SEQ ID NO: 4) and MBmrgaR2 (SEQ ID NO: 5). The upstream 
primer (p920mrgaF2; SEQ ID NO: 4) comprised a synthetic constitutive promoter, P920 (SEQ 
30 ID NO: 6). The PCR product shown in SEQ ID NO: 7 was cut with restriction enzymes Sffl and 
BamHl and ligated to the large Sfil-BamH\ fragment of pDG268neo, resulting in plasmid 
p920mrgA. 

p740mroA: The mrgA gene was amplified by PCR from the chromosome of B. subtilis by 
use of primers p740mrgaF2 (SEQ ID NO: 8) and MBmrgaR2 (SEQ ID NO: 5). The upstream 
35 primer (p740mrgaF2; SEQ ID NO: 8) comprised a synthetic constitutive promoter, P740 (SEQ 
ID NO: 9). The PCR product shown in SEQ ID NO: 10 was cut with restriction enzymes Sffl 
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and BamHl and ligated to the large Sfil-BamHl fragment of pDG268neo, resulting fn plasmfd 
p740mrgA. 

p726mroA: The mrgA was amplified by PCR from the chromosome of B. subtilis by use of 
primers p726mrgaF2 (SEQ ID NO: 11) and MBmrgaR2 (SEQ ID NO: 5). The upstream primer 
5 (p726mrgaF2; SEQ ID NO: 1 1 ) comprised the constitutive synthetic promoter P726 (SEQ ID 
NO: 12). The PCR product shown in SEQ ID NO: 13 was cut with restriction enzymes Sffl and 
BamHl and ligated to the large Sffl-BamH\ fragment of pDG268neo, resulting in plasmid 
p726mrgA. 

10 Primers: 

P920mrgaF (SEQ ID NO: 4): ctgaggccaattaggccaagtttattcttgacattagggaacatgcatgatat 

aataggtaaagtaaacagatcacaaggaggacgttatc 

P740mrgaF (SEQ ID NO: 8): ctgaggccaattaggcccggaagtttgttgacacagctccaggatacaaat 

ataatgggtcgactaaacagatcacaaggaggacgttatc 
15 P726mrgaF (SEQ ID NO: 11): ctgaggccaattaggccgaggtgagatttgacactagtaggctacgggac 

tataatgcgggaagtaaacagatcacaaggaggacgttatc 

MBmrgaR2 (SEQ ID NO: 5): tgaaggatccacgcgtccagcagacagaaagcag 

General molecular biology methods 
20 Unless otherwise mentioned the DNA manipulations and transformations were 

performed using standard methods of molecular biology (Sambrook et al. (1989) Molecular 

cloning: A laboratory manual, Cold Spring Harbor lab., Cold Spring Harbor, NY; Ausubel, F. M. 

et al. (eds.) "Current protocols in Molecular Biology". John Wiley and Sons, 1995; Harwood, C. 

R.. and Cutting, S. M. (eds.) "Molecular Biological Methods for Bacillus". John Wiley and Sons, 
25 1990). 

Enzymes for DNA manipulations were used according to the specifications of the 
suppliers (e.g. restriction endonucleases, ligases etc. are obtainable from New England 
Biolabs, Inc.). 



30 * Media 

LB agar (as described in Ausubel, F. M. et al. (eds.) "Current protocols in Molecular Biology". 
* John Wiley and Sons, 1995). 
LBP is LB agar supplemented with 0.05 M potassium phosphate, pH 7.0 
LBPG is LB agar supplemented with 0.5% Glucose and 0.05 M potassium phosphate, pH 7.0. 
35 LBPSK is LB agar supplemented with 0.05 M potassium phosphate, pH 7.0 and 1% of 
skimmed milk. 

20 
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BPX media is described in EP 0 506 780 (WO 91/09129). 
Fermentations 

Fermentations to evaluate the amylase yields were performed in shakeflasks with 100 ml 
5 BPX at 37°C, 300 rpm for seven days. Culture volumes of 10 ml were harvested and centrifuged 
at 10.000 g to remove cells and debris. The clear supemants were used for assaying alpha- 
amylase activity. 

Assay for alpha-amvlase activity 

10 Alpha-amylase activity was determined by a method employing an enzymatic 

colorimetric test with 4 l 6-ethylidene(G 7 )-p-nitrophenyI(G 1 )-alpha,D-maltoheptaoside 
(ethylidene-G 7 PNP) as substrate (Boehringer Mannheim, Germany art. 1442309). Under a 
specified set of conditions (temp., pH, reaction time, buffer conditions) 1 mg of a given a- 
amylase will hydrolyse a certain amount of substrate and a yellow colour will be produced. The 

15 colour intensity is measured at 405 nm. The measured absorbance is directly proportional to 
the activity of the alpha-amylase in question under a given set of conditions. 

EXAMPLE 1 

The mrgA gene was amplified by PGR from the chromosome of B. subtilis by use of 

20 primers p920mrgaF2 (SEQ ID NO: 4) and MBmrgaR2 (SEQ ID NO: 5). The upstream primer 
(p920mrgaF2) comprised the constitutive synthetic promoter P920 (SEQ ID NO:6). The PCR 
product (SEQ ID NO: 7) was cut with restriction enzymes Sffl and BamHI and ligated to the 
large Sffl-SamHI fragment of pDG268neo, resulting in plasmid p920mrgA. The ligation-mixture 
of p920mrgA, described in the Material and Methods section, was introduced by transformation 

25 into the B. subtilis 168 strain, and the transformants were plated on LBPSK media 

supplemented with 6 microg/ml chloramphenicol to select for integrants. Transformants 
growing on these plates have integrated the plasmid in the amy locus, either by a single (Cam* 
kan*) or double cross-over event (cam* kan*). Transformants were re-streaked on LBPSK/cam 
media with and without 20 microg/ml kanamycine. Strains where double cross-over events had 

30 occurred were cam* kan". These strains no longer showed the tell-tale clearing zones; which 
-signified that that integration in, and disruption of, the amy gene, had taken place. The site of 
integration and the mrgA sequence was verified by PCR and sequence analysis. The strain 
was transformed with plasmid pKTH10 which constitutively expresses the alpha-amylase 
AmyQ of Bacillus amyloliquefaciens. The resulting strain was named AN53. Yields of amylase 

35 from AN53 were determined in triplicate from three independent isolates, and compared to the 
yield of amylase from the control strain AN83. Results are shown in table 1; the AN53 strain 
constitutively expressing mrgA from the synthetic promoter has an increased alpha-amylase 
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yield, which on average is 13% higher than the control strain, AN83, which only comprises a 
wild-type copy of the mrgA gene. 

Table 1, Yields of amylase from AN53 were determined in triplicate from 
5 three independent isolates, and compared to the yield of amylase from the 
control strain AN83. The average yields of each strain are also shown. Nd: 



Not determined. 



Strain 


Amylase activity 
(relative) 


average 


average 
(in %) 


AN53-1.1 


15,7 






AN53-1.2 


15,8 






AN53-1 .3 


14,9 






AN53-2.1 


12,5 






AN53-2.2 


8.65 


13,5 


113% 


AN53-2.3 


Nd 






AN53-3.1 


Nd 






AN53-3.2 


Nd 






AN53-3.3 


Nd 






AN83-1.1 


10,6 






AN83-1.2 


15,7 






AN83-1.3 


13,8 






AN83-2.1 


8.7 






AN83-2.2 


7,72 


11,9 


100% 


AN83-2.3 


10.1 






AN83-3.1 


14.1 






AN83-3.2 


Nd 






AN83-3.3 


14,6 







EXAMPLE 2 

The mrgA gene was amplified by PCR from the chromosome of S. subtilis by use of 
primers p740mrgaF2 (SEQ ID NO: 8) and MBmrgaR2 (SEQ ID NO: 5). The upstream primer 
(p740mrgaF2) comprised the constitutive synthetic promoter P740 (SEQ ID NO: 9). The PCR 
product (SEQ ID NO: 10) was cut with restriction enzymes Sfil and BamHl and ligated to the 
large Sffl-BamHI fragment of pDG268neo, resulting in plasmid p740mrgA. The ligation-mixture 
of p740mrgA, described in the Material and Methods section, was introduced by transformation 

jnto ine a. suowis 168 strain, ana tne iransTormants were plated on LBPSK media 
supplemented with 6 microg/ml chloramphenicol (cam) to select for integrants. Transformants 

. growing on these plates have integrated the plasmid in the amy locus, either by a single (Cam* 
kan + ) or double cross-over event (cam*kan~). The transformants were re-streaked on 
LBPSK/cam media with and without 20 microg/ml kanamycine. Strains where double cross- 
over events had occurred were carn + kan\ These strains no longer showed the tell-tale clearing 
zones; which signified that that integration in, and disruption of, the amy gene, had taken 
place. The site of integration and the mrgA sequence was verified by PCR and sequence 
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analysis. The resulting strain was transformed with plasmid pKTHIO which constitutively 
express the alpha-amyiase AmyQ of Bacillus amyloliquefaciens. The resulting strain was 
named AN55. Yields of amylase from AN55 were determined in triplicate from three 
independent isolates, and compared to yield of amylase from the control strain AN83. Results 
5 are shown in table 2; the AN55 strain constitutively expressing mrgA from the synthetic 
promoter has an increased alpha-amylase yield, which on average is 21% higher than the 
control strain, AN83, which only comprises a wild-type copy of the mrgA gene. 

Table 2. Yields of amylase from AN55 were determined in triplicate from 
10 three independent isolates, and compared to the yield of amylase from the 
control strain AN83. The average yields of each strain are also shown. Nd: 
Not determined. 



Strain 


Amylase activity 
(relative) 


average 


average 
(in %) 


AN55-1.1 


11.5 


14,4 


121% 


AN55-1.2 


10,8 


AN55-1.3 


13.3 


AN55-2.1 


17,4 


AN55-2.2 


17,3 


AN55-2.3 


15,6 


AN55-3.1 


11.2 


AN55-3.2 


14.8 


AN55-3.3 


18.1 


AN83-1.1 


10,6 


11.9 


100% 


AN83-1.2 


15,7 


AN83-1.3 


13,8 


AN83-2.1 


8.7 


AN83-2.2 


7.72 


AN83-2.3 


10,1 


AN83-3.1 


14,1 


AN83-3.2 


Nd 


AN83-3.3 


14.6 



15 EXAMPLE 3 

The mrgA gene was amplified by PCR from the chromosome of B. subtilis by use of 
primers p726mrgaF2 (SEQ iu inu: i i ; anu iviornrgaR2 (SEQ iD NO: 5). The upstream primer 
(p726mrgaF2) comprised the constitutive synthetic promoter P726 (SEQ ID NO: 12). The PCR 
a product (SEQ ID NO: 13) was cut with restriction enzymes Sffl and BamHI and ligated to the 
20 large Sffl-Ba/nHI fragment of pDG268neo, resulting in plasmid p726mrgA. The ligation-mixture 
of p726mrgA, described in the Material and Methods section, was introduced by transformation 
into the B. subtilis 168 strain, and the transformants were plated on LBPSK media 
supplemented with 6 microg/ml chloramphenicol to select for integrants. Transformants 
growing on these plates have integrated the plasmid in the amy locus, either by a single (Cam* 
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kan*) or double cross-over event (cam + kerf). The transformants were then re-streaked on 
LBPSK/cam media with and without 20 microg/ml kanamycine. Strains where double cross- 
over events had occurred were cam* kan\ These strains no longer showed the tell-tale clearing 
zones; which signified that that integration in, and disruption of, the amy gene, had taken 
place. The site of integration and the mrgA sequence was verified by PCR and sequence 
analysis. The resulting strain was then transformed with plasmid pKTMO, which constitutively 
express the alpha-amylase AmyQ of Bacillus amyloliquefaciens. The resulting strain was 
named AN57. Yields of amylase from AN57 were determined in triplicate from three 
independent isolates, and compared to the yield of amylase from the control strain AN83. 
Results are shown in table 3; the AN55 strain constitutively expressing mrgA from the synthetic 
promoter has an increased alpha-amylase yield, which on average is 40% higher than the 
control strain, AN83, which only comprises a wild-type copy of the mrgA gene. 

Table 3. Yields of amylase from AN57 were determined in triplicate from 
three independent isolates, and compared to the yield of amylase from the 
control strain AN83. The average yields of each strain are also shown. Nd: 
Not determined. 



Strain 


Amylase activity 
(relative) 


average 


average 
(in %) 


AN57-1.1 


15,5 


16,6 


140% 


AN57-1.2 


10.6 


AN57-1.3 


17,3 


AN57-2.1 


17,4 


AN57-2.2 


20,7 


AN57-2.3 


13,9 


AN57-3.1 


15.1 


AN57-3.2 


17 


AN57-3.3 


22 


AN83-1.1 


10.6 


11,9 


100% 


AN83-1.2 


15.7 


AN83-1.3 


13.8 


AN83-2.1 


8.7 


AN83-2.2 


7.72 


AN83-2.3 


10.1 


AN83-3.1 


14,1 


AN83-3.2 


Nd 


AN83-3.3 


14,6 
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CLAIMS 

1 . A progeny cell derived from a parent cell, wherein 

a) the progeny cell comprises at least one gene encoding MrgA protein or a functional 
homologue thereof and/or a DNA segment operably linked with the encoding gene, 

5 wherein said gene and/or DNA segment is manipulated with respect to the parent cell; 

b) the progeny cell comprises two or more copies of a gene encoding MrgA protein or a 
functional homologue thereof; or 

c) the progeny cell is mutated with respect to the parent cell; 

whereby the progeny cell produces greater amounts of MrgA protein or a functional homologue 
10 thereof than the parent cell. 

2. The cell of claim 1 , which secretes greater amounts of an exoprotein of interest than 
the parent cell, 

15 3. The cell of claims 1 or 2, which is a bacterial cell, preferably a prokaryotic cell, more 
preferably a Gram-positive cell, and most preferably of the genus Bacillus. 

4. The cell of claim 3, which is of a species chosen from the group consisting of Bacillus 
alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus coagulans, 

20 Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus stearothermophilus, Bacillus 
subtilis, and Bacillus thuringiensis. 

5. The cell of any of claims 2-4, wherein said exoprotein of interest is homologous or 
heterologous. 

25 

6. The cell of any of claims 2-5, wherein said exoprotein is a protease, a lipase, a 
cutinase, an amylase, a galactosidase, a pullulanase, a cellulase, a glucose isomerase, a 
protein disulphide isomerase, a CGTase (cyclodextrin gluconotransferase), a phytase, a 
glucose oxidase, a glucosyl transferase, lactase, bilirubin oxidase, a xylanase, an antigenic 

30 microbial or protozoan protein, a bacterial protein toxin, a microbial surface protein, or a viral 
protein. 

7. The cell of any of claims 1-6, wherein the MrgA protein or functional homologue thereof 
comprises an amino acid sequence which is at least 70% identical to the amino sequence 

35 shown in SEQ ID NO:2, preferably at least 75%, 80%, 85%, 90%, 95%, 97%, or even 99% 
identical to the amino sequence shown in SEQ ID NO:2. 

25 



8. The cell of any of claims 1-7, wherein the MrgA protein or functional homologue thereof 
comprises the amino acid sequence shown in SEQ ID NO:2. 

9. The cell of any of claims 1-8, which comprises at least one exogenous copy of a 
polynucleotide encoding MrgA protein or a functional homologue thereof comprising an amino 
acid sequence which is at least 70% identical to the amino sequence shown in SEQ ID NO:2, 
preferably at least 75%, 80%, 85%, 90%, 95%, 97%, or even 99% identical to the amino 
sequence shown in SEQ ID NO:2. 

10. The cell of any of claims 1-9, which comprises at least one exogenous copy of a 
polynucleotide encoding MrgA protein or a functional homologue thereof comprising the amino 
acid sequence shown in SEQ ID NO:2. 

1 1 . The cell of any of claims 1-10, which comprises at least one exogenous copy of a 
polynucleotide, which: 

a) comprises a polynucleotide sequence which is at least 70% identical to the sequence 
shown in SEQ ID NO:1; preferably at least 75%, 80%, 85%, 90%, 95%, 97%, or even 
99% identical to the sequence shown in SEQ ID NO:1; or 

b) hybridizes with the sequence shown in SEQ ID NO:1, under medium stringency 
conditions, preferably under medium-high stringency, and more preferably under high 
stringency conditions. 

12. The cell of any of claims 1-11, wherein at least one exogenous copy of a gene 
encoding the MrgA protein or a functional homologue thereof is transcribed from one or more 
heterologous and/or artificial promoter. 

13. The cell of any of claims 1-12, wherein at least one exogenous copy of a gene 
encoding the MrgA protein or a functional homologue thereof is integrated into the genome of 
the cell. 

14. The cell of any of claims 1-12, wherein at least one exogenous copy of a gene 
encoding the MrgA protein or a functional homologue thereof is present on an extra- 
chromosomal construct, preferably a plasmid. 

15. A method for enhancing secretion of an exoprotein of interest, the method comprising 
expressing said exoprotein in a cell according to any of the preceding claims. 
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16. A method for producing a cell as defined in any of claims 1-14 useful for production of 
an exoprotein of interest, said method comprising a step of manipulating a cell to increase the 
expression of MrgA protein or functional homologue thereof. 

5 1 7. The method of claim 16, wherein the cell secretes greater amounts of an exoprotein of 
interest than the non-manipulated parent cell. 

18. The method of claims 16 or 17, wherein said method comprises the steps of: 

a) identifying a gene from the parent cell that encodes MrgA protein or a functional 
10 homologue thereof; and 

b) manipulating the cell to increase the expression of the gene identified in step (a), whereby 
the manipulated progeny cell expresses greater amounts of MrgA protein or functional 
homologue thereof, than the non-manipulated cell. 

15 19. The method of any of claims 1 6-1 8, wherein the cell is a bacterial cell, preferably the 
cell is a prokaryotic cell, more preferably a Gram-positive cell, and most preferably the cell is of 
the genus Bacillus. 

20. The method of claim 1 9, wherein the cell is of a species chosen from the group 
20 consisting of Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus 

circulans, Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus 
stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis. 

21 . The method of any of claims 16-20, wherein said exoprotein of interest is a protease, a 
25 lipase, a cutinase, an amylase, a galactosidase, a pullulanase, a cellulase, a glucose 

isomerase, a protein disuphide isomerase, a CGT'ase (cyclodextrin gluconotransferase), a 
phytase, a glucose oxidase, a glucosyl transferase, lactase, bilirubin oxidase, a xylanase, an 
antigenic microbial or protozoan protein, a bacterial protein toxin, a microbial surface protein, 
or a viral protein. 

30 

22. A method for producing an exoprotein of interest, comprising the steps of: 

a) cultivating a cell as defined in any of claims 2-14; and 

b) recovering the exoprotein. 

35 23. The method of claim 22, wherein the cell is a bacterial cell, preferably the cell is a 
prokaryotic cell, more preferably a Gram-positive cell, and most preferably the cell is of the 
genus Bacillus. 
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24. The method of claim 23, wherein the cell is of a species chosen from the group 
consisting of Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus 
circulans, Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus 
stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis. 

5 

25. The method of any of claims 22-24, wherein said exoprotein of interest is a protease, a 
lipase, a cutinase, an amylase, a galactosidase, a pullulanase, a cellulase, a glucose 
isomerase, a protein disuphide isomerase, a CGT'ase (cyclodextrin gluconotransferase), a 
phytase, a glucose oxidase, a glucosyl transferase, lactase, bilirubin oxidase, a xylanase, an 

10 antigenic microbial or protozoan protein, a bacterial protein toxin, a microbial surface protein, 
or a viral protein. 

26. Use of MrgA-protein or a functional homologue thereof in a method for enhancing 
secretion of an exoprotein by manipulating or mutating a cell to express greater amounts of 

15 MrgA protein or functional homologue thereof than the non-manipulated or non-mutated cell. 
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ABSTRACT 

A progeny cell derived from a parent cell, wherein the progeny cell comprises at least 
one gene encoding MrgA protein or a functional homologue thereof and/or a DNA segment 
operably linked with the encoding gene, wherein said gene and/or DNA segment is 
5 manipulated with respect to the parent cell; the progeny cell comprises two or more copies of 
a gene encoding MrgA protein or a functional homologue thereof; or the progeny cell is 
mutated with respect to the parent cell; whereby the progeny cell produces greater amounts of 
MrgA protein or a functional homologue thereof than the parent cell. 
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Novozymes a/S 



A cell with improved secretion mediated by MrgA protein or 
homologue 

10527 . 000-DK 

13 

Patentln version 3.2 
1 

815 
ONA 

Bacillus subtil is 168 



Patent- og 
Varemeerkestyrelsen 

1 0 DEC. 2003 
Modtaget 



<220> 
<221> 
<222> 
<223> 



mi sc_feature 
(201) . . (659) 
MrgA encoding sequence 



<400> 1 

aagaatttgc gatacccgat cggaaagggc atcaagctca ccctgctgtt ccgatcgctt 
tttccttggt ctgcgtggga gtctatcctg aagaaaaagc tattcagctg atctaaatta 
taattattat aatttagtat tgatttttat ttagtatatg atataattaa gtcaacagat 
cacaaggagg acgttatctt atgaaaactg aaaacgcaaa aacaaatcaa acattagttg 
agaattcact gaacacacaa ttatcaaact ggtttctttt atactctaag ctccaccgtt 
tccattggta tgtgaaaggg cctcatttct ttacattgca cgagaaattt gaagaacttt 
atgaccatgc ggctgaaaca gtggatacca tcgctgagcg cctgctggcg attggcggac 
agcctgttgc cacagtgaaa gaatacactg agcatgcatc tatcacagac ggcggaaacg 
aaacatcagc atcagaaatg gtacaagcat tggtaaacga ctacaaacaa atcagcagcg 
aatctaaatt cgtgatcggc ctggctgaag aaaatcaaga caatgcgaca gcggacttgt 
ttgtcggatt aattgaagaa gttgaaaaac aagtgtggat gctttcctct tatttagggt 
aacaaaaaag ctgaacctta atcgggttca gctttttgtt ttttcttagc ttgaactgct 
ttctgtctgc ttggtcagtg ttgcgttcaa cgttttcgtt tttccctttc gcagcacttg 
gattgttgtt ttatctccga cttttaagtc tttgt 

<210> 2 

<211> 153 

<212> PRT 

<213> Bacillus subtilis 168 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
815 



<220> 

<221> PEPTIDE 

<222> (1) . . (153) 

<223> MrgA protein 

<400> 2 

Met Lys Thr Glu Asn Ala Lys Thr Asn Gin Thr Leu val Glu Asn ser 
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15 10 15 

Leu Asn Thr Gin Leu Ser Asn Trp Phe Leu Leu Tyr Ser Lys Leu His 
20 25 30 

Arg Phe His Trp Tyr Val Lys Gly Pro His Phe Phe Thr Leu His Glu 
35 40 45 

Lys Phe Glu Glu Leu Tyr Asp His Ala Ala Glu Thr val Asp Thr lie 
50 55 60 

Ala Glu Arg Leu Leu Ala He Gly Gly Gin Pro val Ala Thr val Lys 
65 70 75 80 

Glu Tyr Thr Glu His Ala ser He Thr Asp Gly Gly Asn Glu Thr ser 
85 90 95 

Ala Ser Glu Met Val Gin Ala Leu val Asn Asp Tyr Lys Gin lie Ser 
100 105 110 

Ser Glu ser Lys Phe val He Gly Leu Ala Glu Glu Asn Gin Asp Asn 
115 120 125 

Ala Thr Ala Asp Leu Phe val Gly Leu lie Glu Glu val Glu Lys Gin 
130 135 140 

val Trp Met Leu ser Ser Tyr Leu Gly 
145 150 



<210> 3 
<211> 8644 
<212> DNA 

<213> Artificial sequence 








<220> 

<223> Plasmid pDG268neo 








<400> 3 
aacaaaattc 


tccagtcttc 


acatcggttt 


gaaaggagga agcggaagaa 


tgaagtaaga 


60 


gggatttttg 


actccgaagt 


aagtcttcaa 


aaaatcaaat aaggagtgtc 


aagaatgttt 


120 


gcaaaacgat 


tcaaaacctc 


tttactgccg 


ttattcgctg gatttttatt 


gctgtttcat 


180 


ttggttctgg 


caggaccggc 


ggctgcsss*- 


Z^Z ZZ cgaacaaatc 


gaatgagctt 


240 


acagcaccgt 


cgatcaaaag 


cggaaccatt 


cttcatgcat ggaattggtc 


gttcaatacg 


300 


ttaaaacaca 


atatgaagga 


tattcatgat 


gcaggatata cagccattca 


gacatctccg 


360 


attaaccaag 


taaaggaagg 


gaatcaagga 


gataaaagca tgtcgaactg 


gtactggctg 


420 


tatcagccga 


catcgtatca 


aattggcaac 


cgttacttag gtactgaaca 


agaatttaaa 


480 


gaaatgtgtg 


cagccgctga 


agaatatggc 


ataaaggtca ttgttgacgc 


gcggccgcgg 


540 


atccatacac 


aaaaaaacgc 


tgtgcccttt 


aaccgcacag cgttttttta 


ttgattaacg 


600 
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cgttgccgct 


tctgcgttaa 


caagtccgct 


tccatacaag ttcgtgcttc 


ctaaactagt 


660 


tgccgtattc 


tttagatgat 


ttcgaatttg 


tacattagac caagatgggt 


tcttttgttt 


720 


aacaagggcg 


gccgcacctg 


caacatgagg 


agtagccatc gatgtaccgt 


ttaagctggc 


780 


atatgttgaa 


cctgggtatg 


tgctctgcac 


gtttaccccg ggtgcgacaa 


tgtcaaggcc 


840 


tgcgccatac 


tgtgaaaagc 


tagcgcggtt 


gttgttttga tcagtagctc 


cgactgccat 


900 


tgcgttcgca 


tagcgcgccg 


gatagctgat 


tgagcctgca cctgaattcc 


cagatgccgc 


960 


tacaacaaga 


acgcctctag 


aagtcgcgct 


attaacagct tgctcgagtg 


tggcacttgg 


1020 


cgaagggctt 


cctaaactca 


aattagcaac 


gtgcatgcca ttgttccctg 


cccattccaa 


1080 


tccttgggca 


atcgagctga 


ccgaacctga 


accgctcgcc cctaggactt 


taacagcgta 


1140 


tagctcagcg 


ctcggcgcta 


cgccaagaac 


gccaatcgaa ttgtttaaag 


cagcgatcgt 


1200 


cccggccaca 


tgcgtgccat 


gcccattccc 


atcttgagtc gacggttccc 


ctggtacaaa 


1260 


gcttgcgcca 


ccacgaatat 


ttagatctgg 


atgagtggat atccctgtat 


cgaggacagc 


1320 


aacttttaca 


ccagaacctg 


tcaatccacg 


gttatgggca gctggggctt 


gcacacggct 


1380 


aattccccat 


ggtaccgatt 


gcgccattgt 


cgttacttct gcatcctctt 


caatataaga 


1440 


aatcgctgga 


tcgagttcaa 


gcgcgtccac 


atcttctggg cttaactcaa 


cggataaaac 


1500 


aggaatcgtt 


tcaaattcat 


gaagcaattc 


aatttcgact tcctcttcct 


cagagagaat 


1560 


ggcgacctcg 


tcatttgcct 


ctacttgttc 


tacaaactca ctgacagctt 


cctgctcatt 


1620 


aaagccaatt 


aaatattttt 


cttttgcttc 


ttcagcagcc gatgcgatcg 


atgaactaaa 


1680 


agcaacagaa 


atgagtagtg 


cggtgcttgc 


gacaattttc cccaacggtt 


tcttcattcg 


1740 


gtttccctcc 


tcatttttat 


agagctccat 


aatacataat tttcaaactg 


ataaaatgat 


1800 


ttttcataaa 


tccattagac 


ggtgcaaata 


tatgttttta atgttcttcg 


tttttaggca 


1860 


tccctccttt 


caagataaat 


aatttataca 


ctattctatt ggaatcttaa 


tcattccaat 


1920 


agaaaaatat 


gtaatgatta 


taaataagtc 


gcttcttatc ataaatatat 


ttacatattc 


1980 


atttaatact 


acatcatgtt 


aggtatagta 


aggctatcaa gggtgtctta 


atttctactt 


2040 


gtaacaatgt 


attggcatat 


tatatattga 


attgagaaaa ttaaatacag 


cgataattca 


2100 


catgaacaag 


ttcattggta 


gttatatttt 


caaattttca aggttgtgct 


tgtatgtcat 


2160 


tctatagtta 


gataagcatt 


tgaggtagag 


tccgtccgaa tatatttgta 


atctgaagaa 


2220 


ggttcaaaca 


tatttctata 


taacgtattc 


tttttttgta gttcttactt 


ttgaggggcg 


2280 


ttacaattca 


aagatattat 


ctttaattaa 


gcttaacatt aataattctt 


caattgcaac 


2340 


aaaaaaagca 


cttttatcta 


aggtttcatc 


ttacgtttcg agggcccctc 


cattttctta 


2400 


tacaaattat 


attatacata 


tcagtaaaat 


aatgtcaacc cccctttatt 


cctttttttt 


2460 


acacagcgga 


cagtctggac 


agcaggccct 


taaggccaat tctcatgttt 


gacagcttat 


2520 


catcggcaat 


agttaccctt 


attatcaaga 


taagaaagaa aaggatttrt 


cgctacgctc 


2580 


aaatccttta 


aaaaaacaca 


aaagaccaca 


ttttttaatg tggtctttat 


tcttcaacta 


2640 
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aagcacccat tagttcaaca aacgaaaatt ggataaagtg ggatattttt aaaatatata 2700 

tttatgttac agtaatattg acttttaaaa aaggattgat tctaatgaag aaagcagaca 2760 

agtaagcctc ctaaattcac tttagataaa aatttaggag gcatatcaaa tgaactttaa 2820 

taaaattgat ttagacaatt ggaagagaaa agagatattt aatcattatt tgaaccaaca 2880 

aacgactttt agtataacca cagaaattga tattagtgtt ttataccgaa acataaaaca 2940 

agaaggatat aaattttacc ctgcatttat tttcttagtg acaagggtga taaactcaaa 3000 

tacagctttt agaactggtt acaatagcga cggagagtta ggttattggg ataagttaga 3060 

gccactttat acaatttttg atggtgtatc taaaacattc tctggtattt ggactcctgt 3120 

aaagaatgac ttcaaagagt tttatgattt atacctttct gatgtagaga aatataatgg 3180 

ttcggggaaa ttgtttccca aaacacctat acctgaaaat gctttttctc tttctattat 3240 

tccatggact tcatttactg ggtttaactt aaatatcaat aataatagta attaccttct 3300 

acccattatt acagcaggaa aattcattaa taaaggtaat tcaatatatt taccgctatc 3360 

tttacaggta catcattctg tttgtgatgg ttatcatgca ggattgttta tgaactctat 3420 

tcaggaattg tcagataggc ctaatgactg gcttttataa tatgagataa tgccgactgt 3480 

actttttaca gtcggttttc taatgtcact aacctgcccc gttagttgaa gaaggttttt 3540 

atattacagc tccagatcct ctacgccgga cgcatcgtgg ccggcatcac cggcgccaca 3600 

ggtgcggttg ctggcgccta tatcgccgac atcaccgatg gggaagatcg ggctcgccac 3660 

ttcgggctca tgagcgcttg tttcggcgtg ggtatggtgg caggccccgt ggccggggga 3720 

ctgttgggcg ccatctcctt gcatgcacca ttccttgcgg cggcggtgct caacggcctc 3780 

aacctactac tgggctgctt cctaatgcag gagtcgcata agggagagcg tcgacatgga 3840 

tgagcgatga tgatatccgt ttaggctggg cggtgatagc ttctcgttca ggcagtacgc 3900 

ctcttttctt ttccagacct gagggaggcg gaaatggtgt gaggttcccg gggaaaagcc 3960 

aaataggcga tcgcgggagt gctttatttg aagatcaggc tatcactgcg gtcaatagat 4020 

ttcacaatgt gatggctgga cagcctgagg aactctcgaa cccgaatgga aacaaccaga 4080 

tatttatgaa tcagcgcggc tcacatggcg ttgtgctggc aaatgcaggt tcatcctctg 4140 

tctctatcaa tacggcaaca aaattgcctg atggcaggta tgacaataaa gctggagcgg 4200 

gttcatttca agtgaacgat ggtaaactga caggcacgat caatgccagg tctgtagctg 4260 

tgctttatcc tgatgatatt gcaaaagcgc ctcat^tttit ccttgagaat tacaaaacag 4320 

gtgtaacaca ttctttcaat gatcaactga cgattacctt gcgtgcagat gcgaatacaa 4380 

caaaagccgt ttatcaaatc aataatggac cagacgacag gcgtttaagg atggagatca 4440 

attcacaatc ggaaaaggag atccaatttg gcaaaacata caccatcatg ttaaaaggaa 4500 

cgaacagtga tggtgtaacg aggaccgaga aatacagttt tgttaaaaga gatccagcgt 4560 

cggccaaaac catcggctat caaaatccga atcattggag ccaggtaaat gcttatatct 4620 

ataaacatga tgggagccga gtaattgaat tgaccggatc ttggcctgga aaaccaatga 4680 
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ctaaaaatar 


oyav-y yaa u L 
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tacaegctga cgctgcctgc ggacaeggat acaaccaacg 


4740 


caaaaatoa t - 

^■fc* ****** \J l- y Q Tw 


ttttaataat 

L L L Lad Lad L 


ggcagcgccc 


aagtgcccgg 


tcagaatcag 


cctggctttg 


4800 


ai!*tacft.t*ftri- 


dddLygLLld 


tataatgact 


egggcttaag 


eggttctett 


ccccattgag 


4860 


y y »y y v. l a 


yauyyydL. ll 


accgaaagaa 


accatcaatg 


atggtttctt 


ttttgttcat 


4920 




dddLLLLLCL 


ettgeaaaag 


tttgtgaagt 


gttgcacaat 


ataaatgtga 


4980 


aa'tacttcar 


aaaraaaaan 
dddv.dddddy 


acatcaaaga 


gaaacatacc 


ctgeaaggat 


gctgatattg 


5040 




f*n c fn ft a ft/* a 
L»y v>L»yydyud 


aaccaaaaac 


ctggtgagac 


aegecttgaa 


ttagtagaaa 


5100 


aGaac*tt*aaa 


naftttraaa 

yd LLL LLddd 


ggcatcgtta 


gtgaagtcat 


ggcgagcgga 


tttgaeggea 


5160 


t "tttct tacit* 


fflfltaaraat 
wyy LaaLda u 


cctcgttaaa 


ggacaaggac 


ctgagcggaa 


gtgtatcgta 


5220 




dy Latdtidy 


tatagtctat 


agtccgtgga 


attattatat 


ttatctccga 


5280 


vy« La L L L* L v. 


dLLdtj igddd 


tccagctgga 


gttctttagc 


aaattttttt 


attagctgaa 


5340 




y uygggcege 


tgataattac 


taatactagg 


agaagttaat 


aaatacgtaa 


5400 




fa a raatta*)* 
LddLddLLdL 


tagaggtcat 


cgttcaaaat 


ggtatgcgtt 


ttgacacatc 


5460 


ra f*t*at"a*ha*l" 


wcy tg xegu t 


ctgtccactc 


ctgaatccca 


ttccagaaat 


tetctagega 


5520 


*t* 1* t* r a ft a a n *• 
l Lt_Ldyddy L 


llc xcagag x 


cggaaagttg 


accagacatt 


acgaactggc 


acagatggtc 


5580 


afanffiriJia 
aLaaWW lyaa 


ggaagaxcxg 


attgettaac 


tgcttcagtt 


aagaccgaag 


cgctcgtcgt 


5640 


a Ida Lalja Ly 


egaxgaxgea 


gaccaatcaa 


catggcacct 


gecattgeta 


cctgtacagt 


5700 


l a d y y d l y g l 


dgdadxgxxg 


teggtccttg 


cacacgaata 


ttacgecatt 


tgectgeata 


5760 


ftraaaranr 


xexxcxaega 


taagggcaca 


aategcateg 


tggaacgttt 


gggcttctac 


5820 




fltttnatara 
y LLLydLdCd 


ctttctctaa 


gtatccacct 


gaatcataaa 


teggcaaaat 


5880 


ananaaaaat 
ayayddddd L 


xyaccaxgxg 


taageggeca 


atctgattcc 


acctgagatg 


cataatctag 


5940 




Lcgcxaxcaa 


aattcacttc 


caccttccac 


tcaccggttg 


tccattcatg 


6000 


o r* t* n a a r* *t~ r* -t* 
y L. LyaaWLv, L 


y t xxccxcxg 


ttgacatgac 


acacatcatc 


tcaatatccg 


aatagggece 


6060 


a*t"r*a fit* f"*f*fia 


CydLLddydy 


agecataaac 


accaatagcc 


ttaacatcat 


ccccatattt 


6120 


atrraatatt 


cgxxccxxaa 


tttcatgaac 


aatcttcatt 


ctttcttctc 


tagtcattat 


6180 


t* a t* t* o n *fr c r* a 
La l Lyy iv.L,d 


LLCdttdLLC 


tcattccctt 


ttcagataat 


tttagatttg 


cttttctaaa 


6240 


taanaal'att 

i_ a ct^JdCt La L L 


tggagagcac 


cgttcttatt 


cagctattaa 


taactegtet 


tcctaagcat 


6300 


Catl^ *~ t* 


«.LTl.LCC.«aCL 


ttttgtxttg 


tccac**"^?^*? 


cccttgattt 


ttxatxtgaa 


~i g\ 
6360 


faaatnrtar 
Let a a Ly l LaL 


xaxxaggaca 


cataatatta 


aaagaaaccc 


ccatctattt 


agttatttgt 


6420 


t* i* a /i "J* zk-f^Qr- 
l Lay tcav tl 


d Lddcxxxaa 


cagatggggt 


ttttctgtgc 


aaccaatttt 


aagggttttc 


6480 


aatactttaa 


aacacataca 


taccaacact 


tcaacgcacc 


tttcagcaac 


taaaataaaa 


6540 


atgacgttat 


ttctatatgt 


atcaagataa 


gaaagaacaa 


gttcaaaacc 


atcaaaaaaa 


6600 


gacacctttt 


caggtgcttt 


ttttatttta 


taaactcatt 


ccctgatctc 


cccatactcc 


6660 


tccaatccaa 


agctatttag 


aaagatrtact 


atatcctcaa 


acaggeggta 


accggcctct 


6720 
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tcatcgggaa 


tgcgcgcgac 


cttcagcatc 


gccggcatgt 


ccccctggcg 


gacgggaagt 


6780 


atccagctcg 


aggtcgggcc 


gcgttgctgg 


cgtttttcca 


taggctccgc 


ccccctgacg 


6840 


agcatcacaa 


aaatcgacgc 


tcaagtcaga 


ggtggcgaaa 


cccgacagga 


ctataaagat 


6900 


accaggcgtt 


tccccctgga 


agctccctcg 


tgcgctctcc 


tgttccgacc 


ctgccgctta 


6960 


ccggatacct 


gtccgccttt 


ctcccttcgg 


gaagcgtggc 


gctttctcat 


agctcacgct 


7020 


gtaggtatct 


cagttcggtg 


taggtcgttc 


gctccaagct 


gggctgtgtg 


cacgaacccc 


7080 


ccgttcagcc 


cgaccgctgc 


gccttatccg 


gtaactatcg 


tcttgagtcc 


aacccggtaa 


7140 


gacacgactt 


atcgccactg 


gcagcagcca 


ctggtaacag 


gattagcaga 


gcgaggtatg 


7200 


taggcggtgc 


tacagagttc 


ttgaagtggt 


ggcctaacta 


cggctacact 


agaaggacag 


7260 


tatttggtat 


ctgcgctctg 


ctgaagccag 


ttaccttcgg 


aaaaagagtt 


ggtagctctt 


7320 


gatccggcaa 


acaaaccacc 


gctggtagcg 


gtggtttttt 


tgtttgcaag 


cagcagatta 


7380 


cgcgcagaaa 


aaaaggatct 


caagaagatc 


ctttgatctt 


ttctacgggg 


tctgacgctc 


7440 


agtggaacga 


aaactcacgt 


taagggattt 


tggtcatgag 


attatcaaaa 


aggatcttca 


7500 


cctagatcct 


tttaaattaa 


aaatgaagtt 


ttaaatcaat 


ctaaagtata 


tatgagtaaa 


7560 


cttggtctga 


cagttaccaa 


tgcttaatca 


gtgaggcacc 


tatctcagcg 


atctgtctat 


7620 


ttcgttcatc 


catagttgcc 


tgactccccg 


tcgtgtagat 


aactacgata 


cgggagggct 


7680 


taccatctgg 


ccccagtgct 


gcaatgatac 


cgcgagaccc 


acgctcaccg 


gctccagatt 


7740 


tatcagcaat 


aaaccagcca 


gccggaaggg 


ccgagcgcag 


aagtggtcct 


gcaactttat 


7800 


ccgcctccat 


ccagtctatt 


aattgttgcc 


gggaagctag 


agtaagtagt 


tcgccagtta 


7860 


atagtttgcg 


caacgttgtt 


gccattgctg 


caggcatcgt 


ggtgtcacgc 


tcgtcgtttg 


7920 


gtatggcttc 


attcagctcc 


ggttcccaac 


gatcaaggcg 


agttacatga 


tcccccatgt 


7980 


tgtgcaaaaa 


agcggttagc 


tccttcggtc 


ctccgatcgt 


tgtcagaagt 


aagttggccg 


8040 


cagtgttatc 


actcatggtt 


atggcagcac 


tgcataattc 


tcttactgtc 


atgccatccg 


8100 


taagatgctt 


ttctgtgact 


ggtgagtact 


caaccaagtc 


attctgagaa 


tagtgtatgc 


8160 


ggcgaccgag 


ttgctcttgc 


ccggcgtcaa 


cacgggataa 


taccgcgcca 


catagcagaa 


8220 


ctttaaaagt 


gctcatcatt 


ggaaaacgtt 


cttcggggcg 


aaaactctca 


aggatcttac 


8280 


cgctgttgag 


atccagttcg 


atgtaaccca 


ctcgtgcacc-caactgatct 


tcagcatctt 


8340 


ttactttcac 


cagcgtttct 


gggt£^c~ 




:^caaaatgcc 


gcaaaaaagg 


8400 


gaataagggc 


gacacggaaa 


tgttgaatac 


tcatactctt 


cctttttcaa 


tattattgaa 


8460 


gcatttatca 


gggttattgt 


ctcatgagcg 


gatacatatt tgaatgtatt 


tagaaaaata 


8520 


aacaaatagg 


ggttccgcgc 


acatttcccc 


gaaaagtgcc acctgacgtc 


taagaaacca 


8580 


ttattatcat 


gacattaacc 


tataaaaata 


ggcgtatcac gaggcccttt 


cgtcttcaag 


8640 


aatt 












8644 
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<211> 91 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer p920mrgaF2 
<400> 4 

ctgaggccaa ttaggccaag tttattcttg acattaggga acatgcatga tataataggt 60 
aaagtaaaca gatcacaagg aggacgttat c 91 



<210> 5 

<211> 34 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer MBmrgaR2 

<400> 5 

tgaaggatcc acgcgtccag cagacagaaa gcag 



34 



<210> 6 

<211> 49 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Promoter P920 

<400> 6 

aagtttattc ttgacattag ggaacatgca tgatataata ggtaaagta 49 



<210> 7 
<211> 642 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR product of promoter 920 and mrgA 
<400> 7 

ctgaggcctt aagggccaag tttattcttg acattaggga acatgcatga tataataggt 60 
aaagtaaaca gatcacaagg aggacgttat cttatgaaaa ctgaaaacgc aaaaacaaat 120 
caaacattag ttgagaattc actgaacaca caattatcaa actggtttct tttatactct 180 
aagctccacc gtttccattg gtatgtgaaa gggcctcatt tctttacatt gcacgagaaa 240 
tttgaagaac tttatgacca tgcggctgaa acagtggata ccatcgctga gcgcctgctg 300 
gcgattggcg gacagcctgt tgccacagtg aaagaataca ctgagcatgc atctatcaca 360 
gacggcggaa acgaaacatc agcatcagaa atggtacaag cattggtaaa cgactacaaa 420 
caaatcagca gcgaatctaa attcgtgatc ggcctggctg aagaaaatca agacaatgcg 480 
acagcggact tgtttgtcgg attaattgaa gaagttgaaa aacaagtgtg gatgctttcc 540 
tcttatttag ggtaacaaaa aagctgaacc ttaatcgggt tcagcttttt gttttttctt 600 
agcttgaact gctttctgtc tgcutgacgc gtggatcctt ca 642 
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<210> 8 
<211> 91 

<213> Artificial sequence 
<223^ Primer p740mrgaF2 

Sgtggclaa ttaggcccgg aag«tg«g acacagctcc aggatacaaa tataatgggt 60 



cgactaaaca gatcacaagg aggacgttat c 



<210> 9 

<211> 49 

J213> Artificial sequence 

<223> Promoter P740 



49 

c^lagt?tg ttgacacagc tccaggatac aaatataatg ggtcgagta . 

<210> 10 
<211> 642 

<213> S?tificial sequence 

<223> PCR product of promoter 740 and mrgA 

cJgtggcSt aagggcccgg aagtttgttg acacagctcc aggatacaaa tataatgggt 60 
cgagtaaaca gatcacaagg aggacgttat cttatgaaaa ctgaaaacgc aaaaacaaat 
caaacattag ttgagaattc actgaacaca caattatcaa actggtttct tttatactct 
aagctccacc gtttccattg gtatgtgaaa gggcctcatt tctttacatt gcacgagaaa 
tttgaagaac tttatgacca tgcggctgaa acagtggata ccatcgctga gcgcctgctg 
gcgattggcg gacagcctgt tgccacagtg aaagaataca ctgagcatgc atctatcaca 
gacggcggaa acgaaacatc agcatcagaa atggtacaag cattggtaaa cgactacaaa 
caaatcagca gcgaatctaa attcgtgatc ggcctggctg aagaaaatca agacaatgcg 
acagcggact tgtttgtcgg attaattgaa gaagttgaaa aacaagtgtg gatgctttcc 
tcttatttag ggtaacaaaa aagctgaacc ttaatcgggt tcagcttttt gttttttctt 
agcttgaact gctttctgtc tgcttgacgc gtggatcctt ca 



<21"0> 11 

<211> 91 

<213> Artificial sequence 

<223> Primer p726mrgaF2 

< 400 > 11 Page8 



120 
180 
240 
300 
360 
420 
480 
540 
600 
642 
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ctgaggccaa ttaggccgag gtgagatttg acactagtag gctacgggac tataatgcgg 6C 

gaagtaaaca gatcacaagg aggacgttat c 91 

<210> 12 

<211> 49 

<212> ONA 

<213> Artificial sequence 
<220> 

<223> Promoter P726 

<400> 12 

gaggtgagat ttgacactag taggctacgg gactataatg cgggaagta 49 

<210> 13 
<211> 642 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> pcr product of promoter P726 and mrgA 
<400> 13 

ctgaggcctt aagggccgag gtgagatttg acactagtag gctacgggac tataatgcgg 60 
gaagtaaaca gatcacaagg aggacgttat cttatgaaaa ctgaaaacgc aaaaacaaat 120 
caaacattag ttgagaattc actgaacaca caattatcaa actggtttct tttatactct 180 
aagctccacc gtttccattg gtatgtgaaa gggcctcatt tctttacatt gcacgagaaa 240 
tttgaagaac tttatgacca tgcggctgaa acagtggata ccatcgctga gcgcctgctg 300 
gcgattggcg gacagcctgt tgccacagtg aaagaataca ctgagcatgc atctatcaca 360 
gacggcggaa acgaaacatc agcatcagaa atggtacaag cattggtaaa cgactacaaa 420 
caaatcagca gcgaatctaa attcgtgatc ggcctggctg aagaaaatca agacaatgcg 480 
acagcggact tgtttgtcgg attaattgaa gaagttgaaa aacaagtgtg gatgctttcc 540 
tcttatttag ggtaacaaaa aagctgaacc ttaatcgggt tcagcttttt gttttttctt 600 
agcttgaact gctttctgtc tgcttgacgc gtggatcctt ca 642 
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